Picture for Yu Gu

Yu Gu

HoliDubber: Holistic Video Dubbing for Complex Acoustic Scenes via Text-Guided Audio Synthesis

Add code
Jun 08, 2026
Viaarxiv icon

MemoryCard: Topic-Aware Multi-Modal Clue Compression for Long-Video Question Answering

Add code
Jun 04, 2026
Viaarxiv icon

Foundation VAEs for 3D CT Reconstruction, Augmentation, and Generation

Add code
May 29, 2026
Viaarxiv icon

Generative Spatiotemporal Intent Sequence Recommendation via Implicit Reasoning in Amap

Add code
May 27, 2026
Viaarxiv icon

Pause and Reflect: Conformal Aggregation for Chain-of-Thought Reasoning

Add code
May 13, 2026
Viaarxiv icon

ReAlign: Optimizing the Visual Document Retriever with Reasoning-Guided Fine-Grained Alignment

Add code
Apr 08, 2026
Viaarxiv icon

InstructTable: Improving Table Structure Recognition Through Instructions

Add code
Apr 03, 2026
Viaarxiv icon

Anticipatory Planning for Multimodal AI Agents

Add code
Mar 17, 2026
Viaarxiv icon

Tau-BNO: Brain Neural Operator for Tau Transport Model

Add code
Mar 09, 2026
Viaarxiv icon

OJBKQ: Objective-Joint Babai-Klein Quantization

Add code
Feb 09, 2026
Viaarxiv icon